Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ SIMD Optimization
AVX-512, Vectorization, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
32411
posts in
10.2
ms
Optimizing
Recommendation Systems with
JDK
’s Vector API
netflixtechblog.com
·
7h
·
Discuss:
Hacker News
,
r/programming
🔄
SIMD Programming
Show HN: A compression tool that beats
xz
on x86_64
ELFs
by 6%
news.ycombinator.com
·
20h
·
Discuss:
Hacker News
🗜️
Vector Compression
Zig `
hexagon-linux-none
` target cross
compilation
ziggit.dev
·
2h
📄
File Formats
Disaggregated
Prefill
and Decode
research.perplexity.ai
·
12h
💾
Prompt Caching
KlongPy
: Automatic
Differentiation
klongpy.org
·
1h
·
Discuss:
Hacker News
🕯️
Candle
Determining
Factorial
Speed Fast
arxiv.org
·
1d
🔍
Binary Analysis
New Zlib-rs Delivers More Performance With AVX-512
VNNI
Adler32
Implementation
phoronix.com
·
20h
⚡
Zero-Copy Serialization
How do i get the best speed out of
Qwen
3.5
9B
in 16GB VRAM?
github.com
·
5h
·
Discuss:
r/LocalLLaMA
🔬
RaBitQ
Deep Dive: How
StarRocks
Built a High-Performance
Vectorized
Engine
starrocks.io
·
6d
⚡
Vectorized Execution
Programming
in K
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🔢
Algebraic Data Types
The Distribution of
Ridgeless
Least Squares
Interpolators
jmlr.org
·
21h
🧠
LLM Inference
μpack
: Faster & more flexible
integer
compression
blog.cf8.gg
·
4d
·
Discuss:
r/programming
,
r/rust
🔬
RaBitQ
WarpSpeed
automatically rewrites Nvidia core library, achieves 3.6-100x
speedup
doubleai.com
·
16h
·
Discuss:
Hacker News
📊
Model Serving Economics
Lower
Latency
and Higher
Throughput
with Multi-node DeepSeek Deployment
research.perplexity.ai
·
12h
🏗️
LLM Infrastructure
A GPU
Microarchitecture
Optimized for Fully
Homomorphic
Encryption
semiengineering.com
·
2d
⚡
Hardware Acceleration
Exposing More
Parallelism
Is the Hidden Reason Why Some Vectorized Loops Are Faster - Not
Vectorization
per se
johnnysswlab.com
·
4d
·
Discuss:
Hacker News
⚡
SIMD
Taming
Momentum: Rethinking
Optimizer
States Through Low-Rank Approximation
arxiv.org
·
1d
📱
Edge AI Optimization
Rare Huawei-ByteDance alliance unveils
RRAM
AI chip delivering 66x CPU speed at
ISSCC
2026
digitimes.com
·
16h
🖥️
Hardware Architecture
Why
Structured
Kernels
?
modular.com
·
4d
⚡
Hardware Acceleration
A
Number
with a
Shadow
campedersen.com
·
1d
🕯️
Candle
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help